VISA - Corpus Annotation with OWL
نویسندگان
چکیده
We present VISA, a graphical annotation tool for OWL-based annotation schemes with a focus on generality and usability.
منابع مشابه
Use of OWL 2 to Facilitate a Biomedical Knowledge Base Extracted from the GENIA Corpus
The annotation of the GENIA corpus, a set of biomedical articles, targets the classification of biological entities based on their association with a domain-tailored taxonomy of categories. By incorporating information extraction process on the corpus we have developed a knowledge base (KB) that includes a more comprehensive taxonomy of categories, relationships between biological entities, and...
متن کاملA generic formalism to represent linguistic corpora in RDF and OWL/DL
This paper describes POWLA, a generic formalism to represent linguistic corpora by means of RDF and OWL/DL. Unlike earlier approaches in this direction, POWLA is not tied to a specific selection of annotation layers, but rather, it is designed to support any kind of text-oriented annotation. POWLA inherits its generic character from the underlying data model PAULA (Dipper, 2005; Chiarcos et al....
متن کاملOWL/DL formalization of the MULTEXT-East morphosyntactic specifications
This paper describes the modeling of the morphosyntactic annotations of the MULTEXT-East corpora and lexicons as an OWL/DL ontology. Formalizing annotation schemes in OWL/DL has the advantages of enabling formally specifying interrelationships between the various features and making logical inferences based on the relationships between them. We show that this approach provides us with a top-dow...
متن کاملSubtopic Annotation in a Corpus of News Texts: Steps Towards Automatic Subtopic Segmentation
Subtopic segmentation aims at finding the boundaries among text passages that represent different subtopics, which usually develop a main topic in a text. Being capable of automatically detecting subtopics is very useful for several Natural Language Processing applications. This paper describes subtopic annotation in a corpus of news texts written in Brazilian Portuguese. In particular, we focu...
متن کاملxGENIA: A comprehensive OWL ontology based on the GENIA corpus
UNLABELLED The GENIA ontology is a taxonomy that was developed as a result of manual annotation of a subset of MEDLINE, the GENIA corpus. Both the ontology and corpus have been used as a benchmark to test and develop biological information extraction tools. Recent work shows, however, that there is a demand for a more comprehensive ontology that would go along with the corpus. We propose a comp...
متن کامل